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ABSTRACT  and  CONTENTS 


This  is  the  reference  manual  for  the  BCC  500  System  Programming 
Language  (SPL).  The  syntax  and  semantics  of  the  language  are  defined. 
This  reference  manual  is  an  edited  and  updated  version  of  a  working  paper 
written  by  Butler  W.  Lampson. 
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1 .  1  I  itroduction 

y 

The  BCC-500  Systems  Programming  Language  (SPL)  was  designed 
for  use  in  systems  development,  translator  writing  and  high  level 
sub-system  development.  It  is  sufficiently  flexible  and  generates 
sufficiently  efficient  object  code  to  replace  assembly  language  in 
almost  all  situations.  SPL  is  highly  interactive,  simple  to  use  from 
a  terminal  and  helpful  in  error  situations. 

This  reference  manual  defines  the  syntax  and  the  semantics  of 
the  language.  Details  about  the  command  language,  the  editor  and 
debugger  will  appear  in  another  manual. 
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2.  Meta  Language 

A  syntax  equation  describes  the  form  of  some  construction  in 
a  language.  For  example,  the  following  equations  describe 
simple  arithmetic  expressions. 

expression  -  ["+/"-"]  term  $(("+7"-")  term); 
term  =  primary  $(("*7"/")  primary  ); 

primary  =  symbol  /  number  /  "("  expression 
The  first  equation  reads  "an  expression  has  the  form  of  an  optional  plus 
or  minus  followed  by  a  term  followed  by  an  arbitrary  number  (including  0) 
of  the  constructs  (denoted  by  either  a  plus  or  minus  sign  followed 

by  a  term."  The  last  reads  "a  primary  has  the  form  of  a  symbol  or  a 
number  or  a  literal  left  parenthesis  followed  by  an  expression  followed 
by  a  right  parenthesis."  Symbols  such  as  'primary',  'term'  and  'expression 
are  defined  above  and  are  called  non-terminal  symbols.  'Symbol'  and 
•number'  are  not  defined  above  and  are,  therefore,  called  terminal  symbols. 
The  construct  "("  is  an  example  of  a  literal. 
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2*1  Syntax  for  Meta  Language  Equations 


equation 

=  symbol  "="  alternation 

alternation 

=  catenation  ${"/"  catenation); 

catenation 

=  term  $term; 

term 

=  primary  /  optional  /  repetition  /  negation; 

primary 

=  symbol  /  literal  /  "{"  alternation  ")"  / 

"<"  text  ">"; 

optional 

=  "["  alternation 

repeti ti on 

=  [integer]  "$"  [integer]  primary; 

negation 

=  primary  primary; 

The  alternation  indicates  that  any  alternative  is  legal;  a  catenation 
indicates  that  the  terms  follow  each  other.  The  last  alternative  of 

the  definition  of 

a  primary  is  included  to  allow  an  "escape"  to  normal 

English  text.  Naturally  the  text  may  not  include  a  ">"  character. 
This  convention  should  be  avoided  if  possible. 


An  'optional'  indicates  zero  or  one  occurrence  of  the  alternatives 
enclosed  in  brackets.  The  meaning  of  the  four  kinds  of  repetitions  are 


Form 


Meaning 


$  something 
1$  something 
$6  something 
2$7  something 


arbitrary  number  of  'something's' 
one  or  more  'something's' 
zero  to  six  'something's' 
two  to  seven  'something's' 


The  negation  construct  is  used  to  make  an  exception.  It  translates 
"any  occu. rence  of  the  first  primary  except  for  an  occcurrence  of  the 


second  primary." 
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2-2  Syntax  of  Meta  Language  Tokens 

Many  languages  are  built  out  of  small  units,  called  tokens.  The  tokens 
of  the  metalanguage  are  symbols,  integers,  literals  and  certain  punctuation 
characters  such  as  'T  and  (see  below).  In  such  languages  blanks 
and  carriage  returns  may  appear  between  tokens  without  affecting  the 
meaning  of  the  information.  The  usual  rule  is  that  blanks  and  carriage 
returns  delimit  (or  separate)  tokens,  but  are  otherwise  ignored. 

Unless  the  contrary  is  explicitly  stated  in  our  documents,  the  reader 
may  assume  the  treatment  of  blanks  is  as  described  above. 

These  equations  define  the  tokens: 

symbol  =  word  $(":"  word) ; 
word  =  letter  $ (letter  /  digit); 
integer  =  digit  $digit; 

literal  =  ""  $charl  ""  /  " $char2  . ; 

charl  =  character  -  ; 

char2  =  character  - 

A  symbol  is  a  sequence  of  words  separated  with  colons.  A  word  is  at  least 
one  letter  optionally  followed  by  letters  or  digit.  An  integer  is 
represented  in  the  normal  way.  A  literal  is  a  string  of  either  charl's 
enclosed  by  double  quotes  or  char2‘s  enclosed  by  single  quotes.  The 
first  form  lo  preferred,  'charl'  is  any  character  except  double  quote: 
'char2',  any  except  single  quote.  Note  that  "?"  is  not  the  same 
as  The  first  form  implies  that  the  "?"  and  may  be  separated  by 

blanks,  while  the  second  requires  that  the  two  characters  appear 
consecutively  with  no  intervening  blanks. 
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These  three  equations  define  the  basic  set  of  characters  for  defining 


tokens. 


character  =  letter  /  digit; 


letter 


digit 


"A"/"B7"C'7"D'7''E"/"F'7"G"/"H'7"r7"J7"K"/"L'7"M7 
I'N" /"o" /”P" /"Q" /"R” /"S” /"T" /"U" /"V" /"W" /"X" /"Y" /"Z"  / 
"a"/"b7"c'7"d7"e'7"f'7"g'7''h'7"i7"j7"k'7"r7"m'7 

"n'7"o7"p7"q7"r7"s'7"t'7"u7"v7"w7"x7"y7"27 

II0II  yiii  ly  II2II /"3'' /''4" /"5" /"6" /"7'7"8" /"9" ; 
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3.  Scopes  and  Program  Format 

An  SPL  program  is  organized  into  blocks.  A  block  begins 
with  a  PROGRAM  or  COMMON  statement  and  ends  with  an  END 
statement.  It  has  a  name  which  is  given  in  its  initial 
statement.  Block  names  must  be  unique  over  the  entire 
program.  Thus  the  general  format  of  a  program  is: 

=  $block; 

=  common: block  /  program: bl ock ; 
=  "COMMON"  identifier  ";" 
block:head 
end:statement 
=  "PROGRAM"  identifier  "j" 
block:head 
$($(label  ":") 
action: statement  ";") 
end: statement 
=  $( include: statement  ";") 

$(al location.’statement  ";") 
$(declare: statement 

allocation:statement  =  fixed:statement  / 

origin: statement; 

label  =  identifier; 


program 

block 

common: block 

program:block 


block:head 


identifier 


=  name; 


n 


Thus,  statements  must  occur  in  a  block  In  the  order: 

PROGRAM  or  COMMON  statement 
include: statements 
al 1 ocati on : s  tatements 
dec1are:statements 
action: statements 
end; statement 

A  commcn:b1ock  must  precede  any  blocks  which  INCLUDE  it. 

3.1  Lexical  format 

Every  statement  ends  with  a  semi-colon. 

Carriage  returns  and  blanks  are  treated  according  to 
the  following  rules: 

1)  inside  string  constants  or  character  constants 
blanks  are  treated  like  ordinary  characters. 
Carriage  returns  are  illegal  in  string  and 
character  constants  (unless  written  with  the 
escape  convention). 

2)  elsewhere  a  string  of  carriage  returns  and  blanks 
is  equivalent  to  a  single  blank. 

3)  a  blank  may  appear  anywhere  except  in  the  middle 
of  a  token.  Tokens  include  names,  reserved  words, 
constants,  the  sequences  "<="  ">="  "**"  "//"  that 
are  SPL  two  character  operators. 
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To  summarize  these  rules  somewhat  sloppily  we  say  that  carriage 
returns  and  blanks  are  ignored  except  in  string  constants, 
names,  and  reserved  words. 

A  comment  has  the  form: 

conment  =  <carrtage  return>  <arbitrary 
st  ’ing  of  characters  not  including 
carriage  return>  <carriage  return>  / 

"/*"  <arbitrary  string  of  characters 
not  including  "*/"  or  carriage  return> 
(''*/"  /  <carriage  return>); 

The  first  form  of  comments  is  exactly  equivalent  to  a 
carriage  return.  The  second  form  is  equivalent  to  a  blank 
if  it  ends  with  a  carriage  return  if  it  ends  with  a 
carriage  return;  the  difference  is  apparent  only  if  it  is 
immediately  followed  by 

Note  that  a  multi -line  comment  must  have  an  *  or  /*  at  the 
start  of  each  line. 

3.2  Scopes 

Each  variable  is  declared  in  some  block  and  is  said  to  be 
local  to  that  block.  The  same  ide  tifier  may  refer  to  two 
different  variables  which  are  loc. I  to  different  blocks. 

The  variable  name  together  with  the  block  name,  however, 
is  sufficient  to  identify  the  variable  uniquely.  A 
variable  is  said  to  be  LOCAL  in  scope  if  it  is  local  to  a 
program  block,  COMMON  if  it  is  local  to  a  common  block. 
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Function  names  (i.e.  names  which  appear  inmediately  after 
FUNCTION  or  ENTRY)  are  GLOBAL  in  scope,  however. 

A  variable  may  be  referenced  only  in  a  block  in  which  it  is 
defined.  Any  variable  is  defined  in  the  block  to  which  it  is 
local.  Suppose  that  block  C  includes  COMMON  blocks  Bi 
{i=l,...,n)  in  that  order.  Then  a  variable  defined  in  Bj 
is  also  defined  in  C  unless  it  is  local  to  C  or  defined  in 
Bi ,  i>j.  A  block  includes  Bi  if  Bi  appears  in  the  identifier: list 
of  an  include: statement  in  the  block. 

include:statement  =  "INCLUDE"  identifier:list; 
identifier:list  =  identifier  $(","  identifier) ; 

All  GLOBAL  variables  are  considered  to  be  defined  in  a  GLOBAL 
COMMON  block  which  is  considered  to  be  included  in  every  block. 

The  effect  of  this  convention  is  that  declarations  in 
COMMON  blocks  can  be  overridden  by  other  declarations  nearer 
the  point  of  use.  Exception:  a  MACRO  name  cannot  be  overridden. 
Note  that  if  B  includes  A  and  C  includes  B,  then  the  variables 
local  to  A  are  defined  in  C  (unless  variables  of  the  same 
name  are  local  to  B  or  C).  A  declaration  overriding  an  INCLUDE 
must  occur  before  any  reference  to  the  variable  involved. 

See  figure  1.  for  an  illustration. 
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4.  Allocation 


SPL  has  a  considerable  amount  of  machinery  for  controlling 
the  allocation  of  storage  for  programs  and  data.  Much  of 
this  machinery  is  of  limited  interest,  but  a  few  parts  of 
it  are  important  to  nearly  all  programmers.  This  section 
discusses  the  topics  of  general  interest  first,  before 
going  on  to  the  others.  The  reader  is  advised  to  break 
off  when  he  encounters  material  of  no  relevance  to  his  needs. 

4.1  Permanency  of  Storage 

Data  in  SPL  is  of  two  kinds:  permanent  and  stacked. 

Permanent  data  stays  around  for  the  life  of  a  program,  i.e. 
the  value  of  a  permanent  data  item,  once  set,  survives  until 
explicitly  changed  by  the  program.  All  data  declared  in 
COMMON  blocks  is  permanent.  Data  declared  in  PROGRAM 
blocks  is  permanent  if  the  block  includes  a 

fixed: statement  =  "FIXED"  [","  "ORIGIN"  expr]; 

The  function  of  the  ORIGIN  clause  is  explained  in  Section  4.3.  This 
statement,  if  it  is  present,  must  appear  between  the  i nclude :statement 
and  the  declare :statements  of  the  block.  The  FIXED 
program  block  may  not  be  entered  recursively  ^ny  funtion  calls) 
during  execution.  This  error  is  not  checked  for. 
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If  a  program  block  is  not  FIXED,  all  the  variables  local  to 
it  are  stacked.  This  means  that  their  values  are  undefined 
when  the  block  is  entered  (by  a  call  to  one  of  its  functions), 
may  become  defined  by  the  action  of  the  program  and  disappear 
when  the  function  returns.  The  block  may  be  entered  recursively, 
and  the  values  of  its  local  variables  for  each  level 
of  recursion  are  completely  distinct. 

4.2  Layout  of  Core 

The  arrangement  of  memory  relative  to  6'  (the  global  environment) 
is  designed  to  group  read-only  objects  together 
and  on  separate  pages  from  writeable  things,  so  that  the 
former  can  be  protected  by  the  hardware  from  modification. 

Later  improvements  will  permit  small  programs  to  be  packed 
together  better. 

Space  is  allocated  in  four  main  regions 

G':WGS-^  ^R«^GS:G'+4Q000B;CS-^  :377777B 

WGS:  Writeable  Global  Storage,  starting  at  G'.  This  area  is 
allocated  by  a  general  storage  allocator  in  the  compiler  in  a 
piecemeal  fashion:  no  attempt  is  made  to  keep  related  things  togethe 
All  the  writeable  variables  which  appear  in  common  blocks,  together 
with  fixed  local  environments  are  put  here. 

Some  of  the  first  128  words  may  also  be  used  for  field  and  array 
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descriptors,  at  the  discretion  of  the  compiler,  except  in 
the  monitor  ring,  where  this  will  never  be  done  (unless  forced 
by  equivalences).  The  first  few  words,  of  course,  are  used  for 
objects  whose  location  is  fixed  by  the  hardware,  like  the  stack 
descriptor.  The  allocatifin  strategy  for  this  area  may  be  modified  by 
ORIGIN  statements;  see  below. 

Collision  of  this  area  with  RSGS  is  a  fatal  error. 

The  stack  (where  stacked  data  is  stored)  is  allocated  space  at 
the  end  of  this  area.  Its  size  depends  on  the  number  of 
non-FIXED  PROGRAM  blocks  entered  but  not  exited. 

RSGS:  Read-only  Scalar  Global  Storage.  Here  are  put  the  constant 
scalars  (e.g.  array  descriptors  and  initialized  scalars) 
from  common  blocks,  as  well  as  function  descriptors.  This  area 
is  allocated  by  another  incarnation  of  the  general  storage  allocation 
used  for  WGS  and  on  the  same  piecemeal  basis. 

CS:  Code  Storage.  Space  here  is  allocated  by  block.  All  the 
code  and  constants  generated  by  one  program  block,  or  all  the 
non-scalar  constants  (strings,  arrays,  etc.)  generated  by 
one  common  block,  are  collected  together  and  allocated  contiguously 
in  that  region.  Transfer  vectors  also  appear  here.  If  block  A 
'ecedes  block  B  lexically  (in  the  source),  then  the 
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CS  for  A  will  precede  the  CS  for  B. 

4.3  Origins 

The  origin: statement  permits  (most  of  the)  storage  of  a  block 
to  be  allocated  at  a  fixed  place. 

origin: statement  =  "ORIGIN"  [expr]; 

The  expression,  whose  value  is  called  the  origin  of  the  block, 
must  evaluate  to  an  integer  at  compile-time.  The  statement 
must  appear  in  the  block  after  any  include:statement  and  before 
anything  else. 

If  the  block  is  a  program  block  or  a  common  block  with  no 
writeable  variables  declared,  the  origin  tells  where  to  start 
its  space  in  CS.  If  the  space  for  the  preceding  block  in  CS  extends 
past  the  specified  origin,  an  error  is  recorded  and  the  statement 
is  ignored.  This  implies  that  origined  blocks  must  appear 
in  order  of  increasing  origins.  Note  that  the  scalar  storage 
of  a  common  block  is  allocated  in  RSGS  and  is  not  affected  by 
origin: statements. 
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If  the  block  is  a  conwon  block  with  writeable  storage,  then  the 
origin  tells  when  to  start  this  storage.  Two  restrictions 
apply 

1)  The  block  must  have  no  requirements  for  CS. 

2)  All  blocks  with  origined  WGS  must  appear  before  any 
non-origined  blocks  which  require  WGS,  so  that  the 
space  taken  by  origined  blocks  can  be  properly 
removed  from  the  control  of  the  storage  allocator. 

All  the  WGS  for  an  origined  block  is  allocated  together.  A 
subsequent  block  may  omit  the  expr  from  its  origin rstatement, 
in  which  case  its  WGS  is  allocated  immediately  following  that 
of  the  preceding  block. 


4.4  Fixed  Environments 

The  location  of  a  fixed  local  environment  may  be  specified  by 
the  fixed rstatement,  thus: 

FIXED,  ORIGIN  expr; 

The  origin  clause  tells  where  to  put  the  environment.  The  programmer 
is  responsible  for  the  security  of  the  area  he  chooses, 
which  is  not  checked  by  the  compiler.  In  the  absence  of  the 
ORIGIN,  the  compiler  will  allocate  the  storage  in  WGS  at  its 
discretion. 
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4.5  Equivalence 

An  equivalence  can  be  used  to  fix  the  location  of  a  scalar  or 
an  array  descriptor  by  writing  an  integer-valued  expression 
for  the  object  of  the  equivalence.  Thus 

DECLARE  A  =  40B,  ARRAY  BC30]  =  41 B; 

allocates  A  at  40  and  the  descriptor  for  the  array  B  at  locations 
41  and  42  octal.  The  array  itself  is  allocated  according  to  the  default 
rules.  Restriction:  t.®  value  of  the  equivalence  must  be  in 
the  range  [6*  ,G'+37777B].  An  equivalence  overrides  all  other 
methods  of  storage  allocation.  If  a  variable  V  has  been  equivalenced 
to  a  constant,  or  is  declared  in  a  common  block,  then 
@V  is  a  constant  whose  value  is  the  address  assigned  to  V. 

4.6  Fixed  Fields 


Descriptors  for  part-word  fields  are  normally  allocated  in  the 
first  128  words  of  the  global  environment  by  the  compiler  if 
there  is  room.  This  allocation  can  be  suppressed  and  the 
field  allocated  in  the  function  or  common  block  like  any  othe^ 
constant  by  prefixing  FIXED  to  [SIGNED]  FIELD  in  the  declaration. 
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5.  Declarations 

5.1  Names 

A  name  is  a  sequence  of  not  more  than  16  characters  starting 
with  a  letter,  each  of  which  must  be  either  alphanumeric  or 
an  '  (apostrophe). 

5.2  Declaration  Statements 


A  decleration  consists  of  a  list  of  names  together  with 
specification  of  scope,  type  and  mode,  and  possibly 
of  initialization  and  equivalence.  Thus: 


dec1are:state- 

ment  =  "DECLARE"  declare:c1ause; list  / 

macrorstatement; 


declare; clause: 

list  =  declare;clause  $(","  declare :clause) ; 


declare:c1ause  =  [type]  [mode]  item; 


item  =  identifier  [form  /  [dimension]  /  [length]] 

[equivalence]  [ini  tialization] ; 

equivalence  =  "="  (identifier  [subscript:list]/ 

("L"'  /  "G"')  subscriptrlis^"  /  expression); 
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subscript:1ist  =  "["  expression  expression) 

initialization  =  (expression  /  "("  expression 

expression)  ")"  ); 

Where  L'  means  local  environment  and  G'  means  the  global  environment. 

A  declaration  is  processed  from  left  to  right.  The  attributes 
are  initialized  as  follows: 


scope 

is  determined  by  whether  the  declaration  is  in  a 

PROGRAM  block  (LOCAL)  or  a  COMMON  block  (COMMON) 

type 

- 

INTEGER 

mode 

SCALAR 

The  values  of  the  three  attributes  are  called  the  state  of 
the  declaration.  Occurrence  of  attribute  specifiers  may  change 
the  state.  A  name  is  given  the  attributes  which  are  in  the 
state  when  it  is  encountered,  except  that  the  form,  dimension, 
or  length,  if  appropriate,  may  follow  the  name  as  indicated 
in  the  syntax  of  item.  FUNCTION,  FIELD,  and  ARRAY  are  taken 
as  specifying  mode  unless  immediately  followed  by  a  mode  word, 
in  which  case  they  specify  type.  Occurence  of  a  type  word 
sets  the  mode  to  SCALAR;  occurence  of  a  mode  word  leaves  the 
type  unchanged.  UNKNOWN  SCALARs  are  not  allowed. 
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EXAMPLE: 

DECLARE  INTEGER  A,  B.  STRING  C,  ARRAY  D,  E  [5], 
ARRAY  [10]  F,  G  (5:12); 

declares  integer  scalars  A  and  B,  string  scalar  C,  string 
arrays  D,  E,  F,  and  G.  The  array  D  is  not  assigned  any 
storage,  but  E  is  assigned  5  elements  and  F  and  G  get  10. 
Except  for  G,  no  space  is  assigned  for  the  string  values  and 
all  the  strings  have  8-bit  bytes;  each  element  of  G  is 
assigned  space  for  5  bytes  at  12  bits  each.  All  these  things 
are  local. 
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5.3  Attributes 

Every  name  has  three  attributes:  scope,  type,  and  mode.  Each 
is  chosen  from  a  fixed  set  of  alternatives: 

scope  =  "COMMON"  /  "LOCAL"  /  "GLOBAL"  ; 

=  ntype  /  ptype  /  "STRING"  [length]  ; 

Ptype  =  "PARAMETER"  integer  /  "PARAMETER  REAL"  / 
"PARAMETER  DOUBLE"  /  "PARAMETER  COMPLEX"  ; 

•^type  =  integer  /  "REAL"  /  "DOUBLE"  /  "COMPLEX"  / 

"LABEL"  /  "LONG"  /  "LONGLONG"  / 

"FUNCTION"  /  "FIELD"  /  "ARRAY"  / 

"UNKNOWN"  ; 

integer  =  "INTEGER"  /  "OCTAL"  /  "CHARACTER"  / 

"POINTER"  ; 

=  "FUNCTION"  /  ["FIXED"]  ["SIGNED"]  "FIELD"  [form]  / 
("ARRAY"  /  "ARRAYONE")  [dimension]/ 

"SCALAR"  ; 

Note  that  if  the  type  is  not  specified  "INTEGER"  is  assumed. 
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Certain  constructions  permitted  by  the  above  syntax  are  forbidden 
because  no  reasonable  meanings  can  be  attached  to  them. 

1)  Objects  of  type  ARRAY  or  STRING  with  mode  FUNCTION 
or  FIELD  do  not  need  dimensions  and  lengths,  and 
to  give  them  as  part  of  the  item  is  an  error. 

2)  A  form  may  appear  only  if  the  mode  is  FIELD,  a 
dimension  only  if  the  mode  is  ARRAY,  a  length 
only  if  the  type  is  STRING. 
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The  type  of  scalar  value  de';ermines  its  size:  integer, 

function  and  field  are  one  word  values;  long,  real,  array,  and 

label,  are  two  words  each;  string,  double,  longlong,  and  complex, 

are  four  words  each.  An  array  is  represented  by  a  two-word  descriptor, 

as  is  a  label  scalar.  A  function  scalar  is  represented  by  a  pointer 

to  the  two -word  descriptor.  A  field  scalar  is  either  a 

constant,  if  its  form  is  specified,  or  occupies  a  single 

word.  The  four  cases  of  integer  are  included  to  permit 

intelligent  printout  of  the  value  during  debugging.  The 

compiler  recognizes  only  one  type  of  integer,  and  the  others  will 

not  be  mentioned  again. 

If  a  name  has  mode  ARRAY  (or  ARRAYONE;  they  are  identical 

except  that  the  latter  causes  subscripts  to  start  at  1 

rather  than  0),  subscripted  references  to  it  will  be  compiled 

on  the  assumption  that  indirection  through  the  descriptor 

will  produce  the  effective  address.  It  is  also  possible  to  subscript 

INTEGER  SCALARS;  such  references  will  add  the  value  of  the  name  to 

the  subscript  to  produce  the  effective  address. 

If  a  name  is  a  field  without  a  form,  tailing  "$"  or 

will  cause  indirection  through  the  location  allocated 
to  it.  If  it  has  a  form,  it  is  treated  as  a  constant  and 
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and  the  code  compiled  depends  on  whether  it  is  full-word  or 
part-word.  If  a  field  appears  without  tailing,  it  is 
treated  as  an  integer  whose  value  is  the  field  descriptor 
if  the  field  had  a  form,  and  the  contents  of  the  location 
allocated  to  it  otherwise.  If  a  field  is  SIGNED,  the  top 
bit  will  be  copied  into  all  the  higher  bit  positions  of  a 
24-bit  word  when  the  field  is  used  to  fetch  a  datum.  Other¬ 
wise,  these  bit  positions  (if  any)  will  be  filled  with  zeros. 

5.4  Attribute  Modifiers 


The  shapes  and  sizes  of  arrays  and  strings  are  specified  by  modifiers 

(dimensions,  lengths  and  forms)  which  have  already  appeared 

in  the  syntax  for  attribute  names.  Throughout,  the  expressions 

must  evaluate  to  constants  at  compile  time.  This 

means  that  all  the  operands  must  be  constant.  See  "Constants" 

in  Section  5.7  for  a  discussion  of  what  operands  are  regarded  as 

constant. 

5.4.1  Dimensions 


dimension  =  "["  expr  $(","  expr) 

[":"  [expr]  [","  expr]]  "]"; 

Arrays  of  any  dimensionality  up  to  7  are  allowed.  The  first 
expression  following  the  colon  specifies  the  number  of  words 
allocated  to  each  element  of  the  array;  this  makes  it  easy 
to  create  tables  with  multi-word  entries.  The  size  of  an 
element  is  limited  to  64  words.  If  it  is  not  specified,  it 
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is  taken  to  be  the  size  of  the  scalar  object  with  the  same 
attributes  as  the  declared  array.  If  an  array  is  given  an 
element  size  different  from  the  one  implied  by  its  type, 
then  subscripting  it  yields  an  expression  of  type  UNKNOWN. 

See  "Expression"  (Section  6.1)  for  the  implications  of  this. 

The  second  expression  following  the  colon  tells  where  to 
allocate  the  first  word  of  the  array.  If  it  is  absent, 
the  array  is  allocated  using  standard  policies  described 
in  Section  4  "Allocation." 

5.4.2  Length 

length  =  "("  expr  [":"  [expr]  [","  expr]]  ")"  ; 

The  string  length  is  specified  in  bytes  by  the  first  expression. 

The  second  expression  gives  the  byte  size,  chosen  from 

6,  8,  12,  and  24;  8  is  the  default  value.  The  third  expression 

tells  where  to  allocate  the  first  word  of  the  string. 

If  an  array  or  string  lacks  dimension  or  length,  no  space 
is  allocated  and  no  descriptor  created  by  the  compiler.  In 
this  case  an  array  is  assumed  to  take  one  subscript. 

If  these  elements  are  present,  space  is  assigned  to  the  local 
environment  if  the  scope  is  LOCAL  and  the  descriptors  are 
initialized  at  function  entry.  If  the  scope  is  COMMON , 
space  is  assigned  in  the  proper  common  block  and  descriptors 
compiled  into  this  block.  See  Section  4  on  "A1  loc^ition"  for  details. 
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5.4.3  Form 

form  =  "("  word idisplacement  [ 

starting: bit  ending: bit] 

. 

word:displace- 

ment  =  expr; 


starting: bit  =  expr; 


ending:bit  =  expr; 


A  form  specifies  the  word  displacement  and  left-  and  right¬ 
most  bits  of  a  field.  If  the  bit  numbers  are  omitted,  0  and 
23  are  used.  A  field  may  not  cross  a  word  boundary. 

5.5  Equivalence 

An  equivalence  has  the  following  meaning:  the  identifier 
following  the  called  the  object,  must  be  previously  declared; 
if  subscripts  appear,  it  must  be  a  dimensioned  array 
and  the  number  of  subscripts  must  match  the  number  of  dimensions. 
The  effect  is  to  assign  the  same  storage  to  the  identifier 
preceding  the  called  the  subject,  as  has  already 
been  assigned  to  the  object.  If  the  identifier  in  the  object  is 
L'  or  G',  the  subject  is  assigned  to  the  designated  location  in 
the  local  or  global  environment  respectively.  If  the  subject  is 
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a  dimensioned  array  or  a  string,  its  descriptor  is  assigned  to 
the  same  location  as  the  object  (to  allocate  the  storage  for 
array  or  string  values,  see  above);  otherwise  the  subject 
itself  is  assigned  to  the  same  location  as  the  object.  No 
account  is  taken  of  the  possibility  that  the  subject  may 
occupy  more  space  than  has  been  allocated  for  the  object. 

For  some  details  and  restrictions,  see  "Allocation"  (Section  4). 

5.6  Initialization 

Initialization  of  SCALARs  has  the  following  meaning:  if  no 
equivalence  is  present,  the  identifier  being  declared  becomes 
synonymous  with  the  initialization  quantity.  For  INTEGERS 
which  lie  in  [-2000B,  1777B],  no  space  is  allocated;  for  all 
other  types,  and  for  INTEGERS  outside  this  range,  space  is 
allocated  to  hold  the  constant  v^.lue  in  RSGS  (for  COMMON 
blocks)  or  CS  (for  PROGRAMS).  If  an  equivalence  appears,  the 
object  must  be  an  absolute  location  (see  "Allocation"),  a 
scalar  or  array  element  with  scope  =  COMMON,  or  an  element 
of  an  initialized  local  array,  and  the  initialization 
quantity  will  be  stored  into  the  variable,  wherever  it  may  be. 

For  each  type  of  SCALAR,  the  initialization  quantity  must  be 
a  constant  of  that  type.  A  LONG  or  a  LONGLONG  may  be 
initialized  with  a  string  constant  or  with  a  list  of  integers; 
this  is  the  only  way  of  introducing  constants  of  these  types 
into  an  SPL  program.  An  initialized  STRING  must  not  have 
a  length. 


31 


Numeric  initialized  variables,  if  not  equivalenced,  may  be 
re- initialized.  This  is  primarily  useful  for  things  like 
defining  fields,  etc.,  using  a  compile-time  counter.  If 
block  A  includes  block  B,  re-initialization  by  a  declaration 
in  A  of  a  variable  acquired  from  B  has  no  effect  on  B 
or  any  other  block  that  includes  B. 

Initialization  of  FUNCTIONS  is  done  with  a  single  name; 
otherwise  the  comments  above  apply.  Initialization  of 
FIELDS  is  illegal;  the  way  to  do  this  is  to  specify  the 
form  explicitly. 

An  ARRAY  is  initialized  with  a  list  of  constants  of  the 
appropriate  type.  (Elements  of  the  list  are  separated  by 
commas  and  the  list  is  enclosed  in  parenther^s ,  as  usual.) 

A  FIELD  ARRAY  may  be  initialized  with  constant  FIELD  SCALARs; 
an  ARRAY  ARRAY  may  be  initialized  with  names  of  arrays  which 
have  been  declared  with  dimensions.  The  elements  of  the  list 
go  into  successive  elements  of  the  array,  starting  with  the 
first  one.  For  mul ti -dimensional  arrays,  the  last  subscript 
varies  most  rapidly,  just  as  the  array  is  actually  stored. 
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A  special  feature  allows  initialization,  at  a  later  time, 
of  further  elements  of  an  array  some  of  whose  elements 
have  already  been  initialized.  If  X  is  a  declared, 
initialized  array,  then  the  appearance  of 

X  subscript:list  initialization 

as  a  declare:clause  will  cause  the  expression(s)  in  the 
initialization  to  be  stored  into  elements  of  the  array 
starting  at  the  one  designated  by  the  subscriptilist.  Of 
course,  all  the  subscripts  must  be  constant. 

5.7  Constants 


For  each  type  there  is  a  syntax  for  constants  representing 
values  of  this  type. 

5.7.1  Integer  Constants 

integerrconstant  =  simple:integer  /  character:constant; 

simple:integer  =  digit  $(digit)  ["B"  [digit]]; 

If  B  appears,  it  causes  the  digits  to  be  interpreted  as  octal; 
otherwise  they  are  taken  to  be  decimal.  If  a  digit  n  follows 
the  B,  it  is  a  scale  factor,  i.e.,  it  is  equivalent  to  n  zeros 
preceding  the  B. 


character: constant  =  {"6"'  $4(pseudo:character)/ 

("8"'  /  $3  (pseudo;character)) 

II  I  II  • 

» 

pseudo:character  =  <character  other  than  &  or  '>/ 

letter/  "&&"  / 

3$3  digit; 

A  character  constant  allows  up  to  three  8-bit  or  four  6-bit 
characters  to  be  right-justified  in  a  24-bit  word  to  make  an 
integer.  Pseudo: characters  permit  quotes  and  control  characters 
to  appear;  the  latter  are  specified  by  the  letter  whose 
code  is  less  by  100B. 

5.7.2  Real  Constants 


real :constant 

simple:real :constant 

exponent 

sign 

digits 


simple:real :constant  [exponent]  / 
digits  exponent; 

digits  $(digit)  /  digits; 
"E"  sign  digi ts  ; 

["+"  / 
l$(digit) ; 


The  meaning  of  this  should  be  obvious;  the  given  decimal 
approximation  to  a  real  number  is  converted  to  the  closest 
approximation  possible  in  the  machine's  48-bit  binary  representation. 


5.7.3  Double  Constants 


doublerconstant  =  (simp1e:rea1 :constant  /  digits)  "D" 

["+"  /  digits; 

In  this  case  the  machine's  96-bit  binary  representation  is 
used.  Note  that  D  must  appear  in  a  double  constant,  and 
that  either  .  or  E  must  appear  in  a  real  constant. 

5.7.4  Imaginary  Constants 

imaginary:constant  =  (real. -constant  /  digits)  "I"; 

Complex  constants  may  be  constructed  by  arithmetic  on  real  .-constants 
and  imaginary : cons tants ;  such  arithmetic  is  performed  at  compile  time 
resulting  in  a  single  complex  constant  in  the  object  code. 


5.7.5  String  Constants 

string:constant  =  ('6'"  /  '8'"  /  "")  $(pseudo:character)  ; 

The  value  is  a  string  with  the  specified  sequence  of  characters 
encoded  in  8-bit  (default  case)  or  6-bit  bytes  as  specified. 
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5.7.6  Label  Constants 


label : constant  =  identifier; 

The  identifier  must  not  appear  in  a  DECLARE  statement;  it  must 
appear  as  a  label  exactly  once  in  the  function,  i.e.,  at  the 
beginning  of  a  statement  and  followed  by  a  colon. 

5.7.7  Constant  Expressions 

The  compiler  will  evaluate  any  expression  consisting  entirely 
of  constants  and  standard  functions  and  thus  will  treat  it 
like  a  single  constant. 

5.8  Data  Formats 


The  formats  of  the  various  kinds  of  values  (i.e.  the  binary 
representations)  are  in  great  part  determined  by  the  hardware 
of  the  machine.  We  summarize  them  here  for  completeness, 
and  to  specify  a  few  conventions  established  by  SPL.  Refer 
to  the  CPU  manual  for  the  exact  word  layouts. 

Integers  are  24-bit  two's  complement. 

Longs  are  48-bit  quantities.  No  operations  are  defined 
on  them  except  general  ones  for  moving  and  decomposing 
any  data  object. 

Longlongs  are  96-bit,  but  otherwise  identical  to  longs. 

Reals  are  48-bit:  sign,  11-bit  exponent  and  36-bit  fracti 
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Double  precision  real  numbers  are  96-bit;  the  format  is 
identical  to  that  for  reals,  except  that  the  fraction 
is  84-bit. 

Complex  numbers  are  96-bit  and  consist  of  two  reals. 

The  real  part  is  the  first,  the  imaginary,  the  second. 
Strings  are  four-word  (96-bit)  objects  called  descriptors. 
Each  word  is  a  hardware  string  indirect  address  word: 


Bits  Function 

0-1  =2,  to  specify  a  string  descriptor 

2-3  byte  size:  0=6-bit,  l=8-bit,  2=12-bit, 
3^24-bit 

4-5  byte  number  in  word,  counting  from  left 

6-23  word  address 


The  four  words 
0: 

1: 

2: 

3: 


are  used  as  follows: 

start  of  string,  points  to  byte  before  first  byte  of  string 

reader  pointer,  points  to  last  byte  read 

writer  pointer,  points  to  last  byte  written 

end  of  string,  points  to  last  byte  available  in  string 


Labels  use  the  hardware's  BLL  descriptor,  which  is  too 
complex  to  be  described  here.  Functions  are  represented  by 
pointers  to  their  BLL  descriptors. 
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Fields  use  the 

hardware's  field  descriptor,  which  is  a  one  word 

object  with  the 

following  form: 

0-1 

=1,  to  specify  a  field  descriptor 

2 

set  for  SIGNED  field 

3-7 

length  in  bits 

8-12 

bit  address  of  first  bi,t 

13-23 

word  displacement 

Arrays  use  the 

hardware's  array  descriptor,  which  is  a  two-word 

object  with  the 

following  form: 

0  :  0-1 

=3,  to  specify  an  array  descriptor 

0  :  2 

lower  bound  (0  or  1)  on  subscript 

0  :  3 

set  for  marginal  index  descriptors  (see  below) 

0  :  4 

large  element  bit 

0  :  5-6  or 

5-10 

multiplier  (element  size) 

0  :  7-23  or 

n-23 

upper  bound  on  subscript 

1  :  6-23 

address  of  first  word  of  array 

Array  of  dimension  >1  are  handled  by  marginal  indexing; 

see  the  discussion  of  arrays  in  Section  6.1.5. 

Intrinsic  functions  will  exist  to  decompose  and  construct 
all  these  descriptors. 
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5.9  Function  Declarations 

The  syntax  is 

function:statement  =  ftype  ("FUNCTION"/  "ENTRY") 

identifier  "("  [dec1are:clause: 
list]  ")"  [","  "FRETURN"]  [function: 
location]; 

ftype  =  ntype  /  "STRING"; 

function:location  =  ("MONITOR"  /  "UTILITY"  /  "POP"  / 

"TRAP 'ENTRY"  /  "FTRAP' ENTRY"  / 

"SP' ENTRY"  /  "SYSPOP")  ["^"  expression]; 

For  example 

FUNCTION  F  (I.  REAL  J,  STRING  ARRAY  K) ; 

ENTRY  and  FUNCTION  are  synonyms. 

5.9.1  Formal  Parameters 

The  dec1are:clause: list  must  not  include  lengths  or  forms. 

It  may  include  dimensions,  but  only  the  number  of  subscripts 
is  counted,  not  the  values,  and  the  subscripts  may  be  null 
(e.g.  A[,]  for  a  matrix).  Arrays  are  assumed  to  have  one 
subscript  if  no  dimension  appears. 

Any  identifiers  in  the  declare ;clause; list  which  have  not 
already  been  declared  are  declared  as  though  they  had  appeared 
in  a  DECLARE  statement  with  the  same  attrioutes.  If  any  such 
identifier  has  already  been  used,  an  error  comment  results. 

For  each  identifier  which  has  already  been  declared  either: 
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1)  the  attributes  specified  for  it  in  the  function 
declaration  must  exactly  agree  with  the  attributes 
already  declared  for  it,  or 

2)  no  attribute  specifiers  may  precede  it  in  the 
declare:clause:list. 

Otherwise  there  will  be  an  error  comment. 

The  identifiers  in  the  declare rclause: list  constitute  the 
formal  arguments  in  the  order  in  which  they  are  written. 
When  the  function  is  called  (see  "function  calls"  below) 
an  equal  number  of  actual  parameters  must  be  supplied,  and 
they  must  agree  in  type  and  mode.  No  automatic  conversions 
are  done.  The  agreement  is  checked  when  the  call  occurs. 

5.9.2  FRETURNs 

The  FRETURN  clause  must  be  included  if  the  function  returns 
with  FRETURN.  In  this  case  it  must  always  be  called  with 
a  failure  clause.  If  any  function  in  a  program  block  has  a 
FRETURN,  the  first  one  must. 

5.9.3  Special  Entry  Points 

The  function:location  specifies  that  the  function  is  to  be 
entered  in  one  of  the  system-defined  transfer  vectors  at 
the  location  specified  by  the  expression.  In  the  case  of 
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POP.  SPL  will  supply  a  location  if  none  is  specified.  The 
possibilities  are: 

POP  the  function  is  to  be  callable  as  a  POP 

TRA’’’ ENTRY  the  function  is  to  ^e  called  when  the 

specified  (ring-dependent)  hardware 
trap  occurs. 

SP' ENTRY  the  function  is  to  be  called  when  the 
sub-process  in  which  it  runs  is  entered 
at  the  specified  entry  point. 

The  remaining  ones  are  of  interest  only  to  system  programmers 

FTRAP' ENTRY  the  function  is  to  be  called  when  the 
specified  (fixed)  trap  occurs. 

MONITOR  the  function  is  to  be  called  when  the 
specified  MCALL  is  executed. 

These  two  make  sense  only  if  the  function 
is  in  the  monitor. 

UTILITY  tht  function  is  to  be  called  when  .-.e 

specified  UCALL  is  executed.  This  makes 
sense  only  if  the  function  is  in  a  utility. 
If  any  function  in  a  program  block  has  a 
MONITOR  or  UTILITY  functionilocation,  the 
first  one  must  have  it. 


SYSPOP 


the  function  is  to  be  called  when  the 


specified  syspop  is  executed. 


The  following  tables  summarize  the  treatment  of  the  various 
special  kinds  of  entry  points. 


Type  of  function 

Call  with 

Return  with  Put  descriptor 

Ordinary 

BLL 

BLL 

-- 

MONITOR 

MCALL 

BLL 

MCALL  Transfer  Vector 

UTILITY 

UCALL 

BLL 

UCALL  TV 

POP 

Pop 

BLL 

POP  TV 

TRAP' ENTRY 

Not  applicable. 

A  trap 'entry  is  not  really 

a  function.  It  does  not  have  argumen';s. 
The  address  of  the  first  word  of  code 


should  be  put  into  the  TRAP  TV.  It  is  a 
programming  error  to  reference  any  local 
variables  or  do  a  return. 

FTRAP' ENTRY  As  for  TRAP 'ENTRY,  but  put  the  address  of 

the  first  word  into  the  FTRAP  TV. 

SYSPOP  As  for  TRAP 'ENTRY,  but  put  the  address  of 

the  first  word  of  code  into  the  TRAP  TV 
at  20B  +  syspop  number. 

SP' ENTRY  SP'CALL  SP' RETURN  SP  TV 

A  sp'entry  is  not  really  j  function.  It  does  not 
have  arguments.  It  is  a  programming  error  to  reference 
any  local  variables.  The  only  proper  way  to  call 
an  sp'entry  is  with  an  sp'call  and  to  return  with 
an  sp' return. 


Table  5.1  Summary  of  Function  Call  Conventions 
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Location  and  contents 


Name  of  TV 

of  descriptor 

Contents  of  TV  entry 

MCALL 

604000B;  UB=MAXMCALL 

Absolute  address  of 
function  descriptor. 
Initialized  to  an 
error  function. 

UCALL 

4030 14B;  UB=MAXUCALL 

As  for  MCALL. 

POP 

G'[0];  UB=MAXP0P 

As  for  MCALL. 

TRAP 

G' [6] ;  UB-12B  except 
for  user  ring,  where 
LIB=20B+MAXSYSP0P 

Absolute  address  of 
code.  Initialized 
to  a  trap  routine. 

FTRAP 

604002B;  UB=14B 

As  for  TRAP 

SP 

G'[12B];  UB=MAXSP 

As  for  MCALL 

All  descriptors  are  normal  lAWs  with  indirect  addressing;  they 

point  to  ARRAY  lAWs  with  LB=0  (LB=1  for  TRAPS),  MULT=1 ,  BASE=indexed  indirect 

source-relative  pointer  to  the  transfer  vector,  which  is 

allocated  in  code  space  at  the  discretion  of  the  compiler. 


The  MAX  symbols  are,  for  the  moment,  built  into  the  compiler 
with  the  following  values: 

MCALL  =  400B,  UCALL  =  400B ,  POP  =  lOOB,  SYSPOP  =  lOOB,  SP  =  20B. 


Table  5.2:  Transfer  Vectors 
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5.10  Macros 

The  language  allows  a  simple  form  of  token-substitution 
macro.  A  macro  is  defined  by  a 

macro: statement  =  "MACRO"  macrorname  ["("  formal :1ist  ")"] 

"■h"  macro  .‘body; 

macro: name  =  identifier  ; 

formal; list  =  [formal  formal)]  ; 

formal  =  identifier  ; 

macro:  body  =  compact  .‘token:  string; 

compact: token: 

string  =  <arbitrary  string  of  tokens  not  including  ";"> 


Once  a  mac'^o.'name  has  been  defined  (i.e.  has  appeared  in  a 
macro: statement)  it  can  only  be  used  in  a  macro:call.  A 
macro: call  may  appear  anywhere  except  in  a  string  or  character 
constant.  It  is 

macro:call  =  macro:name  ["{"  actual :list  ")  "]  ; 

actua1:1ist  =  [actual$(","  actual)] 

actual  =  ba1anced:token:string 

balanced '.token: 

string  =  <compact:token;string  balanced  with 

respect  to  parentheses,  and  not  including 
except  in  parentheses,  or  carriage  return> 


The  actual: list  must  be  present  if  and  only  if  the  formal: list 
was  present  in  the  macro: statement,  and  must  be  of  the  same 
length  as  the  formal: list.  The  macro: call  is  replaced  by  the 
macro  .‘body,  except  that  each  occurrence  of  a  formal  in  the 
macro:body  is  replaced  by  the  corresponding  actual.  The 
result  is  then  scanned  again  for  further  macros. 

Macros  in  a  macro:body  are  expanded  at  definition  time  (unless 
they  have  not  yet  been  defined,  in  which  case  they  are  expanded 
at  call  time  according  to  the  rescanning  rule  stated  above). 

If  expanded  at  call  time,  their  actuals  must  not  include 
any  formal s. 

Note  that  a  macro  is  expanded  strictly  by  token  substitution: 
there  is  no  requirement  that  any  of  the  token  strings  involved 
make  syntactic  or  semantic  sense. 
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6.  Action; Statements 


An  action: statement  is  defined  by: 
action:statement  =  expression/ 

if  :stat''ment/elseif:statement/endif  :statement/ 
f  or  :statement/endf  or  .‘Statement/ 
assembler:statement/ 

6. 1  Expressions 

This  section  provides  the  following  information  about  SPL 
expressions: 

approximate  syntax,  based  on  the  precedence  of  the 
operators 

exact  syntax 

rules  for  types  of  operands 

the  semantics  of  the  various  operators 


6.1.1  Precedence  of  Operators 


Expressions  are  made 
in  order  from  low  to 
FOR  WHILE 
IF  ELSE 
WHERE 
& 

RETURN  FRETURN 

OR 

AND 

NOT  (unary) 
=#>>=<<= 
(on  right) 

MOD 

+  -  V  E' 

*  /  LSH  RSH  LCY 
RCY  A' 

** 

+  -  N'  (unary) 
GOTO 

■*-  (on  left) 

.  $  @ 

$  @  (unary) 

[]  0 


up  of  operators  and  operands .  The  operators , 
high  precedence,  are: 
loops 

conditionals 

sequential  evaluation 

sequential  evaluation 

function  returns 

boolean  "or" 

boolean  "and" 

boolean  "not" 

relations 

assignment 

modulo  or  remainder 

add,  subtract,  logical  or,  logical 

exclusive  or 

multiply,  divide,  left  shift,  right  shift, 

left  cycle,  right  cycle,  logical  and 

exponentiate 

unary  +  - ,  logi cal  not 

transfer 

assignment 

field  operations 

indirection,  reference 

subscripting,  function  call 
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The  operands  are: 

constants 

names 

parenthesised  expressions 


The  section  6.1.1  Operators  by  Precedence  List,  while  convenient 
for  quick  reference,  does  not  suffice  to  specify  the  syntax  of 
expressions.  We  therefore  state  the  complete  syntax;  explanations 
of  the  meaning  of  the  operators  follow  in  section  6.1.3: 


expression  =  forexp; 

forexp  =  ifexp  $(  "FOR"  forclause  /  "WHILE"  ifexp); 

forclause  =  identifier  remainder  (["," 

alternation]  "WHILE"  ifexp  /  ["BY" 
ifexp]  ["TO"  ifexp])  ; 

ifexp  =  whrexp  ["IF"  whrexp  ["ELSE"  ifexp]]; 

whrexp  =  catexp  ["WHERE"  whrexp]; 

catexp  =  retexp  $("&"  retexp), 

retexp  =  alternation  /  ("RETURN"  /  "FRETURN") 

(alternation  /  "("  ifexp  $(","  ifexp) 

")"). 

alternation  =  conjunction  $("0R"  conjunction)  / 


"GOTO"  tailing; 

conjunction  =  negation  $( "AND"  negation); 

negation  =  ["NOT"]  relation; 

relation  =  assignment  [("="  /  /  ">" 


"<"  /  "<=")  assignment]; 


/  ">="  / 


48 


remainder 

= 

sum  ${  "MOD"  sum); 

assignment 

= 

remainder  /  a: tailing  assignment; 

sum 

= 

term  $(("+"  /  /  "V"  /  "E"')  term); 

term 

= 

factor  $(("*"  /  /  "LSH"  /  "RSH"  / 

"LCY"  /  "RCY"  /  "A’")  factor); 

factor 

= 

[••+••  /  /  "N"']  power; 

power 

= 

tailingC  "**"  factor]; 

tailing 

= 

a: tailing  /  v; tailing  ; 

a:tai ling 

= 

indirection  $(tail)  /  reference 

$("."  field)  ; 

v:tai ling 

= 

reference  $(tail)  ; 

tail 

= 

(MM  !  M|M  !  M@M)  . 

indirection 

s 

1 $("$")  arrayref  ; 

reference 

r 

["(3"]  arrayref  /  function: call  ; 

array  ref 

= 

a: primary  $(  "["  expression 

expression)  "]"  )  ; 

function: call 

= 

a: primary  /  v; primary  ; 

a  .'primary 

= 

identifier  /  "("  a: tailing  ")"  ; 

v: primary 

= 

constant  /  "("  expression  ")"  ; 

Note:  this  grammar  is  ambiguous  because  a  function.'call  can  be  parsed  as 
both  a:primary  and  v:primary.  a:primary  parsing  will  be  used  if  possible. 
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The  various  operations  have  various  requirements  for  the 
types  of  operands  permitted  and  the  type  of  result  produced. 

The  permitted  combinations  are  summarized  in  Table  6.1  ,  in 
which  the  following  conventions  are  used, 
type  abbreviations:  I  integer 

6  long  or  longlong 
R  real 
D  double 
C  complex 
S  string 
L  label 
U  unknown 

other  abbreviations:  F  suffix  means  mode  =  FIELD 

T  suffix  means  mode  =  FUNCTION 
Y  suffix  means  mode  =  ARRAY 
S  suffix  means  mode  =  SCALAR 
A  means  any  type 

N  means  I,  R,  D  or  C  (i.e.  number) 

M  means  I ,  R  or  D 

Where  A,  N  or  M  is  suffixed  with  a  digit,  different  digits  indicate 
that  different  types  may  appear.  See,  for  example,  "WHERE": 
argument  1  and  argument  2  may  be  different  types.  If  the  digits  are 
the  same,  or  there  is  no  digit,  the  types  must  be  the  same. 

Note:  A  partial  ordering  on  the  numeric  types  is  defined:  I<R<D, 

R<C.  Where  two  Ns  or  Ms  appear,  the  lower  is  converted  to  the  higher 
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before  the  operation  is  evaluated.  If  the  result  is  N  or  M, 
it  has  the  higher  type  also.  See,  for  example,  If  NSl 

were  "I"  (integer)  and  NS2  were  "R"  (real),  then  NSl  would  be 
converted  to  "R"  and  the  result  would  be  "R".  It  is  illegal  to  have 
one  D  argument  and  one  C  argument.  Where  A  appears,  the  mode 
is  free  except  as  fixed  by  suffixes.  In  all  other  cases  mode 
=  SCALAR. 

Constants  receive  special  treatment.  Any  type  N  constant  is 
automatically  converted  to  a  higher  type  if  that  is  required 
for  an  assignment  to  be  legal.  This  is  not  done  for  variables; 
the  explicit  transfer  functions  described  in  the  definition  of 
action: statement  at  the  start  of  Section  6  must  be  used. 

An  object  of  type  U  may  be  used  where  A  appears  in  Table  6.1. 

It  may  also  be  used  as  one  of  the  operands  in 

the  lines  marked  *,  in  which  case  it  is  assumed  to  have  the 

type  of  the  other. 

Note  the  treatment  of  ARRAYS,  FIELDS  and  FUNCTIONS  of  type 
ARRAY,  FIELD  or  FUNCTION.  When  such  variables  are  applied 
to  subscripts,  pointers  or  function  arguments,  they  yield  results 
of  type  UNKNOWN  and  mode  given  by  their  type.  Normally 
such  results  must  be  assigned  to  something  of  known  type 
before  they  can  be  used  because  of  the  restrictions  on  the  use 
of  type  UNKNOWN;  thus,  for  example,  if  we  want  A  to  be  an 
ARRAY  of  REAL  FUNCTIONS  we  would  write 
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DECLARE  FUNCTION  ARRAY  A,  REAL  FUNCTION  RA 
• 

RA  ^  A[I]; 

RA(X,Y  +  5); 
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ARGl 

OPT 

ARG2 

RESULT 

NOTES 

A 

IF 

I  ELSE  A 

A 

The  A's  are  required 
to  be  the  same  only 

AT 

WHERE 

A2 

A1 

if  the  value  of  the 

IF  is  used. 

A1 

& 

A2 

A2 

*IS 

OR 

IS 

IS 

*IS 

AND 

IS 

IS 

- 

NOT 

IS 

IS 

*NS1,AS  =,^ 

NS2,AS 

IS 

*MS1 

<,<=,>,>= 

MS2 

IS 

*A 

4- 

A 

A 

*MS1 

MOD 

MS2 

MS 

*NS1 

+  -  *  / 

^  i  1  if 

NS2 

NS 

*NS1 

icit 

NS2 

NS 

but  see  details  below 

*IS 

SHIFT, 

IS 

IS 

A'  ,E' ,V' 

- 

N' 

IS 

IS 

- 

+  ,- 

NS 

NS 

- 

GOTO 

LS 

- 

IS 

• 

AF 

I\S** 

AS 

$ 

IF 

IS 

IS 

@ 

IF 

IS 

* 

- 

$ 

IS 

US 

- 

A 

IS 

■ 

AY 

[IS..., 

IS] 

AS** 

IS 

[  IS  ] 

US 

AT 

(A2.  ..., 

An) 

AS** 

*:  one  operand  may  be  U, 

and  is  assumed  to 

have  the 

type  of 

the  other.  — 

**:  if 

A  is  ARRAY,  FIELD 

or 

FUNCTION,  the  result  is 

type  U, 

mode  A. 

Table  6.1 

Permitted  Operands 

and  Type  of  Result 
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6.1.4  Semantics  of  Expressions 


We  now  complete  the  discussion  of  operations  with  comments 
on  the  evaluation  of  each  one,  together  with  some  rema-ks 
which  may  clarify  the  syntax  and  type  conversion  rules  given 
above.  The  operands  are  referenced  by  the  symbols  which  stand 
for  them  in  the  expression  schemata  on  the  left. 


FOR, WHILE  are  discussed  in  Section  6.2.2 

A1  IF  I  ELSE  A2  evaluates  I.  If  I  ^  0,  evaluates 

A1  and  returns  its  value,  otherwise 
evaluates  A2  and  returns  its  value, 
or  returns  0  if  the  ELSE  is  missing. 

Typical  usage  is: 

F(X)  IF  X  <  4  ELSE  G(X)  IF  X<  5  ELSE  H(X)j 


Note  that: 

X  ^  Y  IF  Y  <  3  ELSE  Y+1  ; 

alters  X  only  if  Y  <  3.  Therefore 
wri te: 

X  ^  (Y  IF  X  <  3  ELSE  Y+1)  ; 
if  this  is  intended. 

A1  WHERE  A2  evaluates  A2,  then  evaluates  A1 

and  returns  its  value. 
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A1  &  A2 

RETURN,  FRETURN 
n  OR  12 


evaluates  A1 ,  then  evaluates  A2 
and  returns  its  value.  Several  &'s 
may  be  strung  together. 

See  Section  6.1.6 

evaluates  II,  returns  1  if  it 
is  f  0.  Otherwise  evaluates  12, 
returns  1  if  it  is  0,  otherwise 
return.  0. 


II  ANP  12  evaluates  II,  returns  0  if  it  is 

«0.  Otherwise  evaluates  12,  returns 
0  if  it  is  =  0,  otherwise  returns  1. 

NOT  I  evaluates  I  and  returns  1  if  I  = 

0,  otherwise  i^eturns  0. 


A1  (  =  ,#,>,>=, <,<=)A2 


A1  ^  A2 


Ml  MOD  M2 


*evaluates  Ai  and  A2  and  then 
evaluates  the  relation.  The  value 
is  0  if  the  relation  does  not  hold, 

1  if  it  does.  Note  that  only  = 
and  #  are  legal  on  non-M  types. 

evaluates  A2  and  stores  the  resulting 
value  into  Al .  "^hey  must  agree  in  type 
and  mode  except  for  the  special  treatment 
of  constants ,  and  except  that  one  operand 
may  be  of  type  U. 

*evaluates  Ml  and  M2,  and  returns 
M1-FIX(M1/M2)*M2 


Nl(+, -,*,/)  N2  *obvious 

I1(A' ,V' ,E' )I2  *compute  the  bitwise  "and,"  "or,"  or 

"exclusive-or"  of  the  operands 

I1(LSH,RSH,LCY,RCY)  12  Hhese  are  24-bit  logical  shifts 

(shift  tn  0s)  or  cycles 


*  see  end  of  table. 


N1  **  N2 


(+,-)N 
N'  I 

GOTO  L 


I.  AF 


A  $  IF 


I  @  IF 


♦obvious,  except  that 
II  **12  is  an  error  unless  12  is 
positive.  The  error  is  not  caught 
until  runtime  if  12  is  not  a  constant. 

obvious . 

computes  the  bitwise  (I's)  complement 
of  I. 

sends  control  to  the  statement 
labeled  by  L.  If  this  was  passed 
as  a  parameter,  the  correct  environment 
is  restored. 

♦evaluates  I,  takes  it  as  an 
absolute  address  A,  and  references 
the  bits  of  A  +  word :displacement 
(AF),  from  starting: bit  (AF)  to 
ending:bit  (aF).  The  tesult  may 
appear  on  either  side  of  an 
assignment.  If  the  field  is 
SIGNED,  the  starting  bit  is  copied 
into  all  the  higher  bit  positions 
when  the  result  is  used  as  a 
value;  otherwise  these  positions 
are  filled  with  zeros. 

♦references  the  bits  of  the  value  of 
A  specified  by  IF.  The  word 
displacement  of  IF  should  not  be 
greater  than  the  number  of  words 
in  the  value  of  A.  (4  at  most  if 
A  is  a  variable).  Sign  extension 
is  handled  as  for  above. 

♦returns  T,  where  T  is  the  result 
of: 


T  ^  0 
T$IF  ^  I 

i.e.,  the  valie  of  I  positioned 
in  a  word  according  to  the  field 
IF. 


see  end  of  table. 
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$I 


@A 


AY[I . I] 

IS[I] 


A1(A2....,A2) 

AF(I) 


references  the  value  addressed  by 
the  value  of  I  taken  as  a  hardware 
indirect  word.  Normally  the  top 
6  bits  of  I  should  be  off,  since 
the  hardware  uses  them  to  select 
the  type  of  indirection  rather 
than  to  specify  the  address. 

The  value  is  of  type  U. 

returns  the  address  of  the  value 
of  A.  It  makes  sense  (and  is 
legal)  only  if  A  can  appear  on 
the  left  of  or  is  a  label  constant. 

references  the  element  of  the  array 
A  specified  by  the  subscript  I,  as 
described  in  Section  6.1.5.  If 
the  first  operand  is  IS,  only  1 
subscript  is  allowed.  This  construct 
is  equivalent  to  (IS  +  I).W0,  where 
we  have  declared  FIELD  W0(0). 

returns  the  value  of  the  function 
A1  after  calling  it  with  parameters 
A2,  as  described  in  Section  6.1.6. 

equivalent  to  I.AF 


An  *  preceding  the  description  means  that  the  order  of 
evaluation  of  simple  operands  (see  Section  6. 1.6.1)  is 
undefined.  Compound  operands  are  always  evaluated  left-to-right 
if  there  are  more  than  one.  If  the  *  is  lacking,  the 
operands  are  always  evaluated  left-to-right. 


6.1.5  Array  expressions 


Arrays  of  any  dimensionality  from  1  to  7  are  allowed.  If 
the  array  has  n  dimensions,  then  a  reference  to  it  with  n 
integer  subscripts  yields  a  scalar  of  the  same  type,  unless 
the  type  is  ARRAY,  FIELD  or  FUNCTION.  In  this  case  the  type 
of  the  result  is  UNKNOWN  and  its  mode  is  the  type  of  the 
array.  Thus,  after  declaring: 

1)  INTEGER  I,  c,  K,  REAL  ARRAY  A[3,  4,  5] 
we  know  that: 

2)  A  [I.  J+1,  K**2] 

is  a  REAL  SCALAR.  It  is  also  possible  to  write: 

3)  A  [I,  J+1] 

which  is  an  UNKNOWN  ARRAY.  If  it  is  assigned  to  the  REAL 
ARRAY  B,  then: 

4)  B  [K**2] 

t 

references  the  same  scalar  referenced  by  (2).  It  is  probably 
not  useful  to  do  anything  with  an  UNKNOWN  array  except  to 
assign  it  to  something. 

Marginal  indexing  is  used  to  access  arrays.  In  the  above  example  (1), 
the  value  of  A  is  a  descriptor  for  an  array  with  three 
entries.  Each  entry  of  this  array  is  a  descriptor  for  an  array 
with  four  entries.  Each  entry  of  the  four-entry  array  is  a  descriptor 
for  an  array  with  five  entries,  each  of  which  is  a  real  number. 

The  Figure  6.1.5  illustrates  this.  The  120  words  allocated  for  the  real 
numbers  are  contiguous  in  storage  and  in  the  order  indicated. 

Note  that  Fortran  arrays  vary  the  first  subscript  first 

and  are  therefore  incompatible. 
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6.1.6  Function  call  and  return  expressions 

The  syntax  for  a  function  call  is 

a:primary  "("  [expr  expr)]  [stores] 

["//"  fai lure:resuU  [stores]]")"  ; 
stores  =  identifier  identifier)  ; 

failurerresult  =  ["GOTO"]  identifier  /  ("RETURN"  / 
"FRETURN")  [expr  /  expr: list]  /  "VALUE"  expr  ; 

The  a: primary  must  have  mode=FUNCTION.  The  value  of  the 
function  is  taken  to  be  a  SCALAR  of  type  equal  to  the  type 
of  the  a:primary,  unless  this  type  is  ARRAY,  FIELD  or 
FUNCTION.  In  the  latter  case,  the  type  of  the  result  is  UNKNOWN 
and  its  mode  is  given  by  the  type  of  the  function. 

6. 1 .6. 1  Arguments 

The  arguments  immediately  follow  the  function  name.  There 
is  no  restriction  on  their  number  or  type,  except  that  an 
initialized  LOCAL  label  or  string  array  may  not  be  used. 

F();  F(X);  F(X,Y(1,2),Z[3]**5,W,Q); 
are  function  calls  with  0,  1  and  5  arguments  respectively. 
Arguments  are  evaluated  as  follows.  All  the  arguments  which 
are  compound  are  evaluated,  left  to  right,  and  their  values 
are  saved.  An  argument  is  simple  if  it  is  one  of  the 
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following,  compound  otherwise: 

constant 

identifier 

identifier  "["  identifier  "]" 
identifier  field 
"$"  identifier 

"S"  identifier  full-word  field 

The  value  of  each  argument  is  then  stored  in  the  corresponding 
formal  argument  of  the  function  being  called.  No  type 
conversion  is  done;  nonmatched  types  are  a  run-time  error. 
Control  is  then  passed  to  the  function. 

6 . 1 . 6 . 2  Returns 


A  return  is  done  with  an  expression  of  the  form: 

RETURN  (expr,  expr,...,  expr)/ 

RETURN  expr/ 

RETURN; 

The  expression  list  is  treated  exactly  like  the  actual 
parameter  list  of  a  function  call.  The  value  of  the  first 
expression  becomes  the  value  of  the  function;  it  and  subsequent 
values  are  stored  in  the  corresponding  identifiers 
following  the  in  the  call,  exactly  as  actual  parameter 
values  are  stored  in  formal  parameters. 
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6. 1.6. 3  Failure  Exits 


If  a  failure  exit  is  provided  following  the  "//"  in  the  call, 
an  FRETURN  will  send  control  as  specified  in  the  failure  exit. 

It  may  be  a  label,  in  which  case  control  goes  to  that  address;  a 
RETURN,  in  which  case  a  return  is  made  from  the  function  containing 
the  failure  exit;  or  a  VALUE  expression,  in  which  case  the  value  of  the 
expression  becomes  the  value  of  the  function.  Just  as  for  the 
RETURN,  any  number  of  values  may  be  returned;  they  are  stored 
in  the  corresponding  local :identifiers  following  the 
When  a  function  has  a  failure  exit  the  normal  or  success  return 
is  with  RETURN,  exactly  as  for  a  function  with  no  error  exit. 

6.2  Expressions  used  as  Statements 

In  order  to  catch  some  common  errors  in  which  the  user  inadvertently 
writes  an  expression  statement  which  does  nothing, 
a  set  of  rules  is  enforced.  They  insure  that  evaluation  of 
the  expression  results  in  some  change  in  the  state  of  the 
world;  such  an  expression  is  called  an  action  expression. 

Here  the  principal  operator  is  the  one  of  lowest  precedence 
(i.e.  first  on  the  list  in  the  Section  6.1.1  on  "Precedence  of 
Operators"),  except  that  any  operator  enclosed  in  n  sets  of 
parentheses  is  of  higher  precedence  than  any  operator  enclosed 
in  fewer  than  n  sets  of  parentheses. 
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An  expression  is  an  action  expression  if  the  principal  operator  is: 

a)  GOTO,  RETURN,  FRETURN,  or  a  function  call 

b)  WHERE,  &,  FOR,  WHILE,  IF 

If  the  operator  is  in  group  (b)  then: 

a)  for  "WHERE"  or  both  operands  are  action 
expressions, 

b)  for  "FOR"  or  "WHILE",  the  body  (first  operand) 
is  an  action  expression, 

c)  for  "IF/ELSE",  both  of  the  consequents  are 
action  expressions,  or  the  second  consequent 
is  missing. 

6.2.1  IF  statements 

We  have  seen  that  IF  can  be  used  as  an  operator.  It  can 

also  be  used  in  the  following  way: 

IF  expression  DO; 


ELSEIF  expression  DO; 
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ELSE  DO; 

ENDIF; 

Any  number  of  ELSEIFs  are  allowed.  The  may  be  replaced 

by  any  sequence  of  statements  balanced  with  respect  to  IFs 
and  FORs.  The  ELSE  may  be  omitted.  The  integer  expressions  after 
the  IF  and  ELSEIFs  are  evaluated  in  turn  until  a  non-zero  one  is 
found.  The  statements  between  it  and  the  next  ELSEIF,  ELSE  or  ENDIF 
are  then  executed,  and  control  goes  to  the  statement  following  the 
ENDIF.  THE  ELSE  DO  is  equivalent  to  ELSEIF  1  DO.  If  none  of 
the  expressions  are  non-zero,  nothing  is  done.  It  is  good 
practice  to  indent  the  statements  represented  by  uniformly 

3  or  4  spaces  for  clarity. 

6.2.2  FOR  statements 


The  same  thing  can  be  done  with  "FOR"  as  with  "IF" 
for: statement; 

ENDFOR; 

Here  we  have 

for:statement  =  ("FOR"  for:clause  /  "WHILE"  expression) "DO" ; 
foriclause  =  identifier  "-f-"  (expression! ["BY"  expression2] 
["TO"  express! on3]  /  expression! [" ,"  expression2] 

WHILE"  expression3) ; 


64 


If  the  BY/TO  form  is  used,  the  identifier  must  be  of  type  M. 

If  BY  is  omitted,  BV  1  is  assumed.  If  TO  is  omitted  the  loop 
can  only  terminate  by  an  explicit  transfer  out. 

The  effect  is  that  the  statements  represented  by  . . .  are  executed 
repeatedly  for  successive  values  of  the  controlled  variable. 

In  the  first  case  the  variable  starts  at  expression!. 

On  each  successive  loop  expression2  is  added  until  the 
variable  passes  beyond  expressions.  The  definition  of 
"beyond"  depends  on  the  sign  of  expressions.  If  expression! 
is  beyond  expressions,  the  loop  body  will  not  be 

executed  at  all.  If  the  expressions  are  compound  (see  Section  6.1.6  on 
"Function  Calls"),  they  are  evaluated  before  the  loop  starts; 
if  simple,  then  each  time  around. 

The  second  form  initializes  the  controlled  variable  for 
expression!.  Then  it  tests  integer  expressions.  If  it  is 
0,  control  passes  beyord  the  ENDFOR.  Otherwise  the  loop 
body  is  executed,  the  value  of  expressions  (or  expression! 
if  expressions  is  omitted)  is  assigned  to  the  variable,  and 
the  test  is  made  again.  The  expressions  are  re-evaluated 
each  time  around  the  loop. 

A  WHILE  statement  simply  loops  until  the  integer  expression 
is  0,  without  modifying  anything. 

When  "FOR"  or  "WHILE"  is  used  as  an  operator,  exactly  the  same 
facilities  are  provided.  The  first  argument  is  evaluated 
each  time  around  the  loop.  The  value  is  undefined.  Thus 
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A[I,J]  ^  9  FOR  W  TO  N  FOR  TO  M; 
would  initialize  the  array  A. 

6.3  Assembly  Language 

An  assembler:statement  consists  of  one  or  more  machine  instructions 
according  to  the  following  syntax: 


assembler: statement 

=  machine  .'instruction  $('," 

["."]  machine:instruction) ; 

machine:instruction 

=  opcode  [address]; 

opcode 

=  identifier  /  simple: integer; 

address 

=  expression; 

Since  opcodes  apnearin  a  restricted  context,  the  symbols  used 
for  opcodes  in  the  CPU  Manual  (which  are  all  recognized  by  SPL) 
may  be  used  freely  for  other  purposes  as  well.  If  an  opcode 
is  an  identifier  and  not  predefined,  it  must  be  an  INTEGER 
constant.  Such  opcodes,  as  well  as  opcodes  which  are  written 
as  integers,  are  treated  as  follows:  if  no  address  appears,  the 
value  of  the  opcode  is  placed  directly  in  the  compiled  program; 
if  an  address  does  appear,  bits  18-23  of  the  opcode 
value  are  placed  in  bits  3-8  of  the  instruction  word  and  bit 
17  of  the  value  is  placed  in  bit  9  (the  programmed  operator 
bit) . 

Any  expression  may  appear  as  an  address  as  long  as  it  is 
logically  equivalent  to  either  a  constant  (of  any  type  and 
mode)  or  one  of  the  addressing  formats  of  the  CPU.  These 
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formats  are  described  in  oetail  in  the  CPU  Manual  and  are  listed 
below,  together  with  the  usual  way  of  generating  them.  Note 
the  existence  of  the  four  reserved  symbols  X',  L',  G',  and  R'. 


Addressing  format 


Normal  syntax 


D 

G'[N] 

I 

$G'[N] 

X 

X'[N] 

PD 

P[N] 

PDI 

$P[N] 

BX 

A[I] 

BXD 

($X')[I+N] 

IM 

N 

IMX 

X'+N 

SR 

R’[N] 

SRI 

$R'[N] 

LR 

L'[N] 

LRI 

$L-[N] 

In  the  above  list,  N  stands  for  an  INTEGER  constant  quantity, 
P  and  I  stand  for  INTEGER  SCALAR  quantities,  and  A  stands  for 
an  ARRAY  quantity,  BX  or  PD  addressi  ig  may  also  result  from 
tailing.  Since  the  determination  of  the  addressing  format 
is  done  on  semantic,  not  syntactic,  grounds,  the  exact 
rules  are  quite  complex. 
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7.  Intrinsic  Functions 


Figure  7.1  lists  all  the  intrinsic  functions  in  SPL. 

An  intrinsic  function  is  one  which: 

1. )  Is  recognized  by  the  compiler  without  the  need 
for  any  declaration  by  the  user; 

2. )  May  have  default  argument  values  automatically 
supplied  by  the  compiler; 

3. )  Has  the  types  of  its  arguments  checked  at  compile 

time; 

4. )  May  compile  into  special  open  code. 

In  figure  7.1,  default  values  for  arguments  which  the  user 
is  allowed  to  omit  are  given  in  parentheses  after  the 
argument  type.  For  all  functions  which  have  failure  returns,  a 
routine  which  prints  an  error  message  and  causes  a  sub¬ 
process  trap  will  be  supplied  if  the  user  fails  to  specify 
a  failure  action. 

The  remainder  of  this  section  describes  each  intrinsic 
function  in  detail.  Type  letters  with  subscripts 
will  be  used  to  refer  to  the  arguments  of  a  function: 
e.g.  the  arguments  of  CNS  will  be  referred  to  as  11,52,13, 
and  I^. 


68 


m 

u 


I-  . 

I  ' 


I 


h 


pt 

I 


I 


fv 


NAME  ARGUMENT  TYPES  RESULT  TYPE  FRETURN?  OPEN  CODE? 


FIX 

R 

I 

ENTIER 

R 

I 

FLOAT 

I 

R 

DFLOAT 

I 

D 

RE 

C 

R 

IM 

c 

R 

CSN 

s.l(10) 

I 

X 

CSR 

s 

R 

X 

CSD 

s 

D 

X 

CNS 

i.s.K  0  ).i(i0) 

S 

X 

CRS 

R,S.I(0) 

s 

X 

CD: 

O.S,I(0) 

s 

X 

INCOES 

I, I 

I 

LNGOES 

M 

I 

"Cl 

S 

I 

X 

WCI 

I.S 

I 

X 

GCD 

s 

I 

X 

.'CD 

I,s 

I 

X 

SETUP 

s. 1, 1, 1(8) 

s 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 

X 


I  =  integer 

C  = 

complex 

R  =  real 

A  = 

array 

S  =  string 

D 

jub  le 

I 


Figure  7.1  List  of  intrinsic  functions 
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NAME 

ARGUMENT  TYPES 

RESULT  TYPE 

FRETURN? 

OPEN  CODE? 

SETS 

S. 1(0). 1(0) 

S 

X 

SETR 

S.I(0) 

S 

X 

SETW 

S.I(0) 

S 

X 

LENGTH 

S 

I 

X 

SCOPY 

S.S 

S 

X 

APPEND 

S.S 

S 

X 

GC 

S 

I 

X 

STORINIT 

1. 1 

I 

X 

MAKE 

1.1(0) 

I 

X 

SETZONE 

I 

I 

X 

SETARRAY 

A 

I 

X 

FREE 

1.1(0) 

- 

X 

EXTZONE 

1. 1 

- 

X 

FREEZONE 

1.1(0) 

- 

X 

BCOPY 

i.i.K-i) 

- 

X 

BSET 

i.i.K-i) 

- 

X 

I  =  integer 

C  =  complex 

R  =  real 

A  =  array 

S  =  string 

D  =  double 

Figure  7.1  (continued) 
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7.1  Type  Conversion  Functions 


FIXlR-j)  converts  R-j  to  an  integer  by  truncation  towards 

zero. 

ENTIER(R^)  converts  R^  to  the  nearest  integer. 

FLOAT(Ii)  converts  I-j  to  single-precision  floating  point. 

DFLOAT(I^)  converts  I-j  to  double-precision  floating  point. 

The  four  operators  above  are  converted  directly  into  machine 
instructions.  For  details  consult  the  part  of  the  CPU 
Manual  which  deals  with  handling  of  floating 
point  numbers. 

RE(C^)  gives  the  real  part  of  C.j  in  single-precision 
floating  point. 

IM(C^)  gives  the  imaginary  part  of  C-j  in  single-precision 
floating  point. 

CSN(Spl2//F)  expects  to  find  an  integer  as  the  beginning 
of  S^,  with  syntax  ['+‘  /  l$<digit  in  base  l^>. 

Digits  above  9  are  allowed  if  l2>10:  the  next  digit  after 
9  is  A,  and  so  on.  I2  is  taken  as  10  if  not  supplied.  CSN 
returns  the  integer,  which  it  reads  off  the  string,  advancing 
the  reader  pointer  so  that  the  next  character  to  be  read 
is  the  non-digit  which  ends  the  integer,  or  to 
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the  end  of  the  string.  CSN  fails  if  S-j  does  not  begin 
with  an  integer  in  the  proper  format,  leaving  the  reader 
pointer  unchanged. 

CSR(Si//F)  expects  to  find,  at  the  beginning  of  S-j, 
a  real  number  in  any  of  the  formats  allowed  by  SPL  for  REAL 
quantities.  It  returns  a  single-precision  floating  point 
number.  Otherwise  the  action  is  the  same  as  for  CSN. 

CSD(S^//F)  is  the  same  as  CSR  except  that  it  returns 
a  double-precision  floating  point  result.  Either  SPL 
REAL  or  DOUBLE  syntax  is  acceptable;  in  the  former  case, 
the  number  is  accumulated  in  double  precision. 

CNS(I^  ,52,13, I^//F)  converts  the  value  of  I-j  to  a 
string  of  characters,  which  it  appends  to  S2.  The  radix 
is  I^,  assumed  to  be  10  if  omitted.  If  bit  0  of  I3  is  on, 

I-j  is  converted  unsigned  (e.g.  -2  will  appear  as  77777776 
in  radix  8);  otherwise,  a  precedes  the  converted  absolute 
value  if  I-j  is  negative.  Bits  18-23  of  I3  give  the 
number  of  characters  to  generate:  enough  blanks  are  written 
before  the  converted  value  to  give  the  total  number  of 
characters  required.  If  the  converted  value  does  not  fit 
into  this  many  characters,  it  is  truncated  on  the  left  with 
no  error  indication.  If  the  character  count  is  0,  the  converted 
value  is  neither  padded  nor  truncated.  I^  is  taken  as  0  (signed, 
no  formatting)  if  omitted.  CNS  fails  only  if  there  is  insufficient 
room  to  write  the  necessary  number  of  characters  onto  S2-'  leaving 
the  writer  pointer  is  unaffected. 
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CRS(R^ ,S2il3//F)  appends  the  converted  value  of 
R^  to  $2.  The  failure  condition  is  the  same  as  for  CNS. 
I3  specifies  the  format  in  some  as  yet  unspecified  way; 

I3  =  0  is  assumed  if  is  omitted,  results  in  some 
reasonable  unformatted  conversion. 

CDS(Di ,S2.l3//F)  is  exactly  like  CRS  except  that  the 
converted  value  is  in  DOUBLE  rather  than  REAL  format. 


String  Functions 


In  this  section  the  following  abbreviations  are  used: 

BP  =  beginning  pointer,  RP  =  reader  pointer,  WP  =  writer 
pointer,  EP  =  end  pointer.  These  correspond  to  the  4 
words  of  an  SPL  string  descriptor,  in  order. 


INCDES( I-j  ,12)  assumes  that  is  a  character  pointer 
(hardware  string  indirect  word),  such  as  one  of  the  4 
words  in  an  SPL  string  descriptor.  The  value  is  I-j 
incremented  by  I2  character  positions.  See  CPU  Manual 
for  the  exact  specification  of  this  operation,  which  is 
done  with  the  ASP  instruction. 


LNGDES(  I-j  ,12)  assumes  that  and  I2  are  both 
character  pointers.  It  returns  the  length  of  the  string 
which  they  bracket.  Set  the  CLS  instruction  in  CPU  Manual 


for  details. 
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GCI(Si//F)  fails  if  is  empty,  i.e.  RP  =  WP. 

Otherwise  it  increments  RP  by  one  chara'ter  position  then 
returns  the  character  pointed  to  by  RP. 

WCI(I^,S2//F)  fails  if  $2  is  full,  i.e.  WP  =  EP. 

Otherwise  it  increments  WP  then  writes  I-j  at  the  character  position 
pointed  to  by  WP.  The  value  of  WCI  is  I-j. 

GCD(Si//F)  fails  if  S-j  is  empty.  Otherwise  it  returns  the  character 
pointed  to  by  WP  and  then  decrements  WP. 

WCD( ,S2//F)  fails  if  $2  is  initialized,  i.e.  BP  =  RP. 

Otherwise  it  writes  I-j  at  the  character  position  pointed  to  by 
RP  and  then  decrements  RP.  The  value  of  WCD  is  I-j. 

SETUP(S^ ,12,1314)  puts  into  $1  a  string  descriptor  for 
a  string  of  I2  characters  starting  with  the  first  character  of 
the  word  pointed  to  by  I3.  The  character  size  is  I4,  defaulting  to 
8.  The  value  of  SETUP  is  the  string  descriptor  it  creates.  If 
is  omitted,  MAKE  is  called  to  assign  space.  BP  =  RP  =  WP  in  the 
created  string  descriptor. 

SETS(S.|  ,12,13)  is  exactly  equivalent  to  SETW(S.|,l2) 
followed  by  SETR(S.|  ,12) :  see  below.  I2  and  are  taken 
as  0  if  omitted. 
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SETR(Si,l2)  sets  S-j's  RP  to  point  I2  characters  beyond 
BP.  IF  l2<0.  it  is  taken  as  0;  if  l2>LNGDES(BP,WP) ,  it  is 
taken  as  this  quantity;  if  1 2  is  omitted,  it  is  taken  as  0. 

The  effect  is  that  the  RP  remains  between  the  BP  and  the  WP. 

SETW(Si,l2)  sets  S-j's  WP  to  point  I2  characters  beyond 
BP.  There  are  four  cases:  l2<0  leads  to  WP-<-RP-»-BP ;  0£l2 
<LNGDES(BP,RP)  leads  to  WP^RP^INCDES(BP,l2) ;  LNGDES(BP,RP) 
<l2<LMaDES(BP,EP)  leads  to  WP^INCDES(BP,l2) ;  and  l2>LNGDES 
(BP,EP)  leads  to  WP-^-EP.  Again,  the  effect  is  to  guarantee 
the  correct  order  of  BP,  RP,  WP,  and  EP. 

LENGTH(Si)  gives  the  number  of  GCI's  that  can  be  done 
on  $1  without  failing,  i.e.  LNGDES(RP,WP) . 

GC(S^)  returns  the  character  pointed  to  by  RP.  This 
is  garbage  if  S-j  is  empty,  but  no  check  is  made. 

SC0PY(Si  ,S2//F)  copies  the  string  $2  into  the  string 
S-j.  $2  is  not  affected;  for  ,  RP-^BP  and  WP-^INCDES 
(BP,  LENGTH(S2)).  Failure  occurs  only  if  there  is  not  enough  room 
in  S-j  and  no  pointers  are  affected. 

APPEND(Si  ,82)  appends  the  string  $2  to  the  string  S.| , 
advancing  S.|'s  WP  by  LENGTH(S2).  The  failure  condition  is  the 


same  as  for  SCOPY. 
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7.3  Storage  Allocation  Functions 

There  is  a  standard  mechanism  for  allocating  and  releasing 
arbitrary-sized  blocks  of  storage  in  arbitrary  o>"der  called 
the  storage  allocator.  It  is  driven  by  the  following 
standard  functions: 

STORINIT  (I 1,12)  initializes  the  storage  allocator 
to  use  an  area  of  storage  beginning  at  I-j  and  occupying  I2 
number  of  words  for  its  machinations.  It  is  not  necessary 
to  call  STORINIT;  a  standard  area  will  be  reserved  if 
STORINIT  has  not  been  called  when  the  first  request  is  made 
for  a  block.  The  value  of  STORINIT  is  a  pointer  to  the  zone 
just  created;  this  pointer  is  also  put  into  the  predeclared 
global  pointer  variables  INFINITY 'ZONE  and  CURRENT 'ZONE. 

MAKE(I^,l2)  creates  a  block  of  storage  of  I-j  words 
and  returns  a  pointer  to  it.  An  extra  cell  is  assigned  by 
the  system;  it  immediately  precedes  the  block  and  contains 
the  length  in  the  bottom  18  bits  and  flags  in  the  top  6. 

The  cell  is  for  system  use  only.  Space  is  normally 
allocated  directly  from  the  area  specified  by  STORINIT 
(or  the  standard  default  area);  this  area  is  called  the 
infinity  zone.  The  user  may  set  up  zones  of  his  own;  for 
example,  if  he  wishes  10  create  some  fairly  complex 
temporary  structure  and  then  delete  it  in  its  entirety, 
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it  is  more  efficient  to  create  it  in  a  separate  zone 

and  then  release  the  entire  zone.  I2>  which  is  optional, 

is  a  pointer  to  a  user  zone;  if  it  is  omitted,  the  zone 

pointed  to  by  CURRENT 'ZONE  is  used.  CURRENT 'ZONE  is 

set  by  the  function  SETZONE(Ii)  which  provides  compatibility 

with  the  (hardware)  storage  allocator.  A  user  zone  is 

created  by  the  function  STORINIT. 

SETARRAY(A^ )  which  makes  the  space  occupied  by  the 
array  A^  into  a  new  zone  by  setting  up  some  machinery 
inside  it.  CURRENT 'ZONE  is  not  set  by  this  function. 

Blocks  are  released  by  the  function  FREE. 

FREE(I^,l2)  where  the  block  pointed  to  by  must 
fall  within  the  zone  optionally  given  by  I2.  When  a 
zone  is  full,  i.e.,  a  call  on  MAKE  finds  insufficient 
space,  an  overflow  function  is  executed.  The  address  of 
the  descriptor  for  this  function  is  in  word  1  of  the  zone; 
it  is  initialized  to  a  system  error  routine  when  the  zone  is 
created.  The  user,  of  course,  may  change  it  at  any  time. 

The  function  receives  the  arguments  of  MAKE  as  its  arguments. 
Frequently  the  proper  course  of  action  is  to  acquire  additional 
space  and  attach  it  to  the  zone:  this  is  done  by  EXTZONE. 

EXTZ0NE(I.|  ,12)  adds  all  the  space  in  the  block 
1 2  to  the  zone  pointed  to  by  I.|.  When  a  zone  reaches  the 
end  of  its  usefulness,  all  the  space  occupied  by  the 
zone  must  be  released  using  the  function  FREEZONE. 
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FRr.EZONE(Ii ,12)  releases  all  the  space  (Including 
extra  extensions)  occupied  by  the  zone  into  the  zone 
I2.  If  the  extensions  were  allocated  out  of  more  than 
one  zone,  the  user  must  release  them  individually  with  FREE. 

7.4  Miscellaneous  Functions 


BC0PY(Ii ,12,13)  copies  words  starting  at  I2  to 
words  starting  at  I-j.  Copying  is  done  in  the  appropriate 
direction  (i.e.  starting  at  the  beginning  or  the  end 
of  the  block)  to  ensure  that  no  information  is  lost.  If 
I^  is  omitted,  I2.SIZE  is  used,  where  FIELD  SIZE  (-1:6,23); 
this  is  where  the  storage  allocator  hides  the  block  size. 

The  intention  is  that  I3  should  be  omitted  if  the  block 
pointed  to  by  I2  was  created  with  MAKE,  since  other  objects 
in  SPL  such  as  arrays  and  strings  do  not  have  this  word. 

BSET(  I.|  ,12,13)  initializes  1 3  words  starting  at  I.| 
to  the  value  I2.  If  I3  is  omitted,  I^j.SIZE  is  used  as  in 


BCOPY. 


Appendix  I 
1.1  St  of  Keywords 


A* 

AND 

ARRAY 

ARRAYONE 

BY 

CHARACTER 

COMMON 

COMPLEX 

DECLARE 

DO 

DOUBLE 

E' 

ELSE 

ELSEIF 

END 

ENDFOR 

ENDIF 

ENTRY 

EXTERNAL 

FIELD 

FIXED 

FOR 

FRETURN 
FTRAP* ENTRY 
FUNCTION 

G' 

GOTO 

IF 

INCLUDE 

INTEGER 

L* 

LABEL 

LCY 

LONG 

LONGLONG 

LSH 


MACRO 

MOD 

MONITOR 

N' 

NOT 

OCTAL 

OR 

ORIGIN 

PARAMETER 

POINTER 

POP 

PROGRAM 

R' 

RCY 

REAL 

RETURN 

RSH 

SCALAR 
SIGNED 
SP' ENTRY 
STRING 
SYSPOP 

TO 

TRAP 'ENTRY 

UNKNOWN 

UTILITY 

V 

VALUE 

WHERE 

WHILE 

X' 


